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JPO; 

DERWEN 
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BRS 


L11 
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^UU4/1^/1U 
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.Qata.and.mejTio^ 

P, R. Panda, F. Catthoor, N. D. Dutt, K. Danckaert, E. Brockmeyer, C, Kulkarni, A. Vandercappelle, P. G. Kjeldsb 
April 2001 ACM Transactions on Design Automation of Electronic Systems (TODAES), volume 6 issue 2 



Full text available: txift 339.91 KB; 



Additional Information: full cilg;l!on , abstract , references , citings , index terms 



We present a survey of the state-of-the-art techniques used in performing data and memory-related optimiza 
systems. The optimizations are targeted directly or indirectly at the memory subsystem, and impact one or m 
important cost metrics: area, performance, and power dissipation of the resulting implementation. We first ex 
independent optimizations in the form of code transoformations. We next cover a broad spectrum of optimiza 

Keywords: DRAM, SRAM, address generation, allocation, architecture exploration, code transformation, data 
optimization, high-level synthesis, memory architecture customization, memory power dissipation, register fil 
survey 



2 Compiier transformations for high-performance computing 
David F. Bacon, Susan L. Graham, Oliver J. Sharp 
December 1994 ACM Computing Surveys (CSUR), volume 26 issue 4 

Full text available: ■^pdf(.a32..MB}. Additional Information: MLcitatipr!, abstract^ fMsfences, citings, index.terms, .re 

In the last three decades a large number of compiler transformations for optimizing programs have been impi 
optimizations for uniprocessors reduce the number of Instructions executed by the program using transforhiat 
analysis of scalar quantities and data-flow techniques. In contrast, optimizations for high-performance supers 
parallel processors maximize parallelism and memory locality with transformations that rely on tracking the p 

Keywords: compilation, dependence analysis, locality, multiprocessors, optimization, parallelism, superscala 
vectorlzatlon 



Ib-i..gvoJution.of.th^^^ 

B. R. Borgerson, M. L. Hanson, P. A. Hartley 

January 1978 Communications of the ACM, volume 21 issue 1 

Full text available: ' godff1.39 fvlB) Additional Information: flili crtatjori, abstract, citings, index terrrss 

The 1100 series systems are Sperry Univac's large-scale mainframe computer systems. Beginning with the 11 
series has progressed through a succession of eight compatible computer models to the latest system, the 11 
1977. The 1100 series hardware architecture Is based on a 36-bit word, ones complement structure which ob 
from storage and one from a high-speed register, or two operands from high-speed registers. The 1100 Oper 

Keyw rds: 1100 computer series, computer architecture, data management systems, end user facilities, exe 
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software, multiprocessing, multiprogramming, operating system, programming languages 



^ .ComDuting.cumcula..200 

September 2001 Journal on Educational Resources In Computing (JERIC) 

Full text available: -xiftaiS.ea KB) I g) htmi(2.78 KB) Additional information: fun citation, references, citings, index terms 



* .E§M„dstectJpn.oL^^ 

Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies on Collaborativ 

Full text available: r)df(4.21 MS) Additional Information: full citation, abstract , references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on process-time dia 
to obtain a better understanding of the execution of the application. The visualization tool we use is Poet, an e 
at the University of Waterloo. However, these diagrams are often very complex and do not provide the user w 
overview of the application. In our experience, such tools display repeated occurrences of non-trivial commun 

^ ABLE;„A.LlSP-basM.k^^ 
Gary B. Goates, Suhas S. Patil 

June 1981 Proceedings of the 18th conference on Design automation 

Full text available: ' Qpdft692.32 KB) Additional Information: full citation, abstract, references, index terms 

ABLE; an array-based linguistic editor, is a layout modeling language for storage/logic arrays (SLA's) that is b 
programming language. This paper describes ABLE's design, presents an ABLE layout program, and evaluates 
SLA-based circuit design. ABLE embodies a linguistic approach to computer-aided design (CAD) for very large 
(VLSI) circuits; digital system designers can represent SLA-based integrated circuits as relatively abstract and 

^ Jil@„laniijy..olcQncurm^^^^ 
Ehud Shapiro 

September 1989 ACi^ Computing Surveys (CSUR), volume 21 issue 3 

Full text available: ^i2df(9,62_MBl Additional Information: fijii citation , abstract, references, citings. Inciex terms 

Concurrent logic languages are high-level programming languages for parallel and distributed systems that of 
both known and novel concurrent programming techniques. Being logic programming languages, they preserv 
the abstract logic programming model, including the logical reading of programs and computations, the conve 
data structures with logical terms and manipulating them using unification, and the amenability to metaprogr 

8 System-ievei power ODtimization; techniques and tools 
Luca Benini, Giovanni de Michell 

April 2000 ACM Transactions on Design Automation of Electronic Systems (TODAES), volume s issue 2 
Full text available: '^.pdgSSg.g^. KB) Additional Information: fylMstion, abstract, reMences, .citings, jndexjerms 

This tutorial surveys design methods for energy-efficient system-level design. We consider electronic sytems 
hardware platform and software layers. We consider the three major constituents of hardware that consume e 
computation, communication, and storage units, and we review methods of reducing their energy consumptio 
models for analyzing the energy cost of software, and methods for energy-efficient software design and comp 



9 CASDAL: GASSM^s DAta Language 
Stanley Y. W. Su, Ahmed Emam 

March 1978 ACi^ Transacti ns n Database Systems (TODS), Volume 3 issue 1 

Full text available: 'Qpdf(.272.MB) Additional Information: M.i?Jtation, abstract., references, citings, Index.terms 

CASDAL is a high level data language designed and implemented for the database machine CASSM. The langu 
manipulation and maintenance of a database using an unnormalized (hierarchically structured) relational data 
facilities to define, modify, and maintain the data model definition. The uniqueness of CASDAL lies in its powe 
operations in terms of several new language constructs and its concepts of tagging or marking tuples and of m 
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Keywords: associative mennory, database, nonprocedural language, query language, relational model 



Architecture, of the 

Richard P. Case, Andris Padegs 

January 1978 C mmunications f the ACM, volume 21 issue 1 

Full text available: ' ^ pdf(2.7S MB) Additional Information: fun citcillon. abstract , feferences , citings, index terms 

This paper discusses the design considerations for the architectural extensions that distinguish System/370 fr 
comments on some experiences with the original objectives for System/360 and on the efforts to achieve the 
reasons and objectives for extending the architecture. It covers virtual storage, program control, data-manipu 
timing facilities, multiprocessing, debugging and monitoring, error handling, and input/output operations. ... 

Keywords: architecture, computer systems, error handling, instruction sets, virtual storage 



Draft Proposed: American National Standard— Graphical Kernel System 

Technical Committee X3H3 - Computer Graphics 

February 1984 ACM SIGGRAPH Computer Graphics, volume is issue si 

Full text available: "gpdf(.l£-.Q.7Ma) Additional Information: .tu!J..ailatj.Qn. 



^ ^ Qnja ndom izati on Jn . se 

Rajiv Gupta, Scott A. Smolka, Shaji Bhaskar 

March 1994 ACM Computing Surveys (CSUR), Volume 26 issue 1 

Full text available: ' ^ ♦xjf(8.0l t^AB) Additional Information: fui! citation . ;abstract . references , citings . Index terms 

Probabilistic, or randomized, algorithms are fast becoming as commonplace as conventional deterministic algo 
presents five techniques that have been widely used in the design of randomized algorithms. These technique 
12 randomized algorithms—both sequential and distributed— that span a wide range of applications, including 
classical problem in number theory), interactive probabilistic proofs ... 

Keywords: Byzantine agreement, CSP, analysis of algorithms, computational complexity, dining philosophers 
algorithms, graph isomorphism, hashing, interactive probabilistic proof systems, leader election, message rou 
neighbors problem, perfect hashing, primality testing, probabilistic techniques, randomized or probabilistic alg 
quicksort, sequential algorithms, transitive tournaments, universal hashing 



A language-based approach to protocol implementation 
Mark B. Abbott, Larry L. Peterson 

February 1993 IEEE/ ACM Transactions on Networking (TON), Volume 1 issue 1 

Full text available: ■plp.dfQJ8..:y1Bi Additional Information: fgli.citalion. references, .citings. Index terms, reyjevv 



A Vjdeo.Compre^^^^^ 

D. Rizzo, O. Colavin 

March 2002 Proceedings of the conference on Design, automation and test in Europe 

Full text available: ^ ^^^^347 52 kb^ M Publisher Site Additional Information: ftjil citation , abstraci 



In this paper, we investigate the benefits of a flexible,application-specjfic instruction set by adding a run-time 
Functional Unit (RFU) to a VLIWprocessor. Preliminary results on the motion estimationstage in an MPEG4 vid 
presented. Withthe RFU modeled at functional level and under realisticassumptions on execution latency, tech 
andreconfiguration penalty, we explore different RFUinstructions at fine-grain (instruction-level) and coarse-g 
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David W. Wall 

May 1992 ACM Transactions on Programming Languages and Systems (TOPLAS), volume i4 issue 3 
Full text available: 'g pdf(2.36 TvlB) Additional Information: full citation, abstract, references, citings, index terms , xg^ 

We have built a system in which the compiler back end and the linker work together to present an abstract m 
considerably higher level than the actual machine. The intermediate language translated by the back end is th 
all high-level compilers and is also the only assembly language generally available. This lets us do intermodul 
which would be harder if some of the code in the program had come from a traditional assembler, out of sigh 

Keywords: RISC, graph coloring, intermediate language, interprocedural, optimization, pipeline scheduling, p 
allocation, register windows 



1^ Code optimization - i: Optimizing memory accesses for spatial computation 
Mihai Budiu, Seth C. Goldstein 

March 2003 Proceedings of the international symposium on Code generation and optimization: feed 
runtime optimization 

Full text available: ^ ^xjfd.Oo N^B;^ Publisher Site Additional Information: tiiji crtation. abstract , references. inde:?< terms 

In this paper we present the internal representation and optimizations used by the CASH compiler for improvl 
parallelism of pointer-based programs. CASH uses an SSA-based representation for memory, which compacti 
control-flow-and dependence information. In CASH, memory optimization is a four-step process: (l)first an in 
representation of memory dependences is built; (2) next, unnecessary memory dependences are removed us 



Prog ram ming.Janguage^^^^^ 

Henri E. Bal, Jennifer G. Steiner, Andrew S. Tanenbaum 

September 1989 ACM Computing Surveys (CSUR), volume 21 issue 3 

Full text available: Ddf^6.50 MB) Additional Information: fuil citation, abstracl; references , cltincis. index terms , xe 

When distributed systems first appeared, they were programmed in traditional sequential languages, usually 
few library procedures for sending and receiving messages. As distributed applications became more common 
sophisticated, this ad hoc approach became less satisfactory. Researchers all over the world began designing 
languages specifically for implementing distributed applications. These languages and their history, their unde 



MeasuremenLand.e^^^^^ 

Thomas R. Gross, John L. Hennessy, Steven A. Przybyisici, Christopher Rowen 
August 1988 ACi^l Transactions on Computer Systems (TOCS), volume 6 issue 3 

Full text available: t>df(2.30 iVIB) Additional Information: fun citation, ab&trad. references, citings, index terms . le 

MIPS is a 32-bit processor architecture that has been implemented as an nMOS VLSI chip. The instruction set 
based. Close coupling with compilers and efficient use of the instruction set by compiled programs were goals 
The MIPS architecture requires that the software implement some constraints in the design that are normally 
hardware implementation. This paper presents experimental results on the effectiveness of this processor ... 



A lanQuage-based approach to protocoi implementation 
Mark B. Abbott, Larry L. Peterson 

October 1992 ACM SIGCOMI^ Computer Communication Review , Conference proceedings on Commun 

architectures & protocols, volume 22 issue 4 
Full text available: 'g fxjft1.28 WiB) Additional Information: full citation, abstract, references. citinc3S. index terms 

Morpheus is special-purpose programming language that facilitates the efficient implementation of communic 
Protocols are divided Into three categories, called shapes, so that they can Inherit code and data structures ba 
the programmer implements a particular protocol by refining the inherited structure. Morpheus optimization t 
layer overhead on time-critical operations to a few assembler instructions even though the ... 

Practicai data breakpoints: design and impiementation 
Robert Wahbe, Steven Lucco, Susan L. Graham 

June 1993 ACM SIGPLAN Notices , Pr ceedings of the ACM SIGPLAN 1993 conference on Programm 
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. ' design and implementation, volume 28 issue 6 

Full text available: '^pAlitSI.MS.l Additional Information: felLcrtatlP-O, afestract references, citings, jndex.terms 

A data breakpoint associates debugging actions with progrannmer-specifled conditions on the memory state o 
program. Data breakpoints provide a means for discovering program bugs that are tedious or Impossible to is 
breakpoints alone. In practice, programmers rarely use data breakpoints, because they are either unimpleme 
slow in available debugging software. In this paper, we present the design and Implementation of a practical 
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